Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
Methods Mol Biol ; 2426: 315-331, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36308695

RESUMO

Adaptive PENSE is a method that can be used to build models for predicting clinical outcomes from a small subset of a potentially large number of candidate proteins. Adaptive PENSE is designed to give reliable results under two common challenges often encountered in these kinds of studies: (1) the number of samples with known clinical outcome and proteomic data is small, while the number of candidate proteins is large and/or (2) proteomic data and the clinical outcome measurements suffer from data quality issues in a small fraction of samples. Even in the presence of these challenges, adaptive PENSE reliably identifies proteins relevant for prediction and estimates accurate predictive models. Adaptive PENSE is designed to be resilient to data quality issues in up to 50% of samples. Almost half of the samples could have aberrant values in the measured protein levels and clinical outcome values without causing severe detrimental effects to the estimated predictive model. The method is implemented as an R package and supports the user in the model selection process by automating most steps and providing diagnostic visualizations to guide the user. Users can choose among several predictive models to select the model with high prediction accuracy and an appropriate number of selected proteins.


Assuntos
Proteínas , Proteômica , Proteínas/genética , Projetos de Pesquisa
2.
Stat Med ; 41(18): 3511-3526, 2022 08 15.
Artigo em Inglês | MEDLINE | ID: mdl-35567357

RESUMO

The continuous evolution of metabolomics over the past two decades has stimulated the search for metabolic biomarkers of many diseases. Metabolomic data measured from urinary samples can provide rich information of the biological events triggered by organ rejection in pediatric kidney transplant recipients. With additional validation, metabolic markers can be used to build clinically useful diagnostic tools. However, there are many methodological steps ranging from data processing to modeling that can influence the performance of the resulting metabolomic classifiers. In this study we focus on the comparison of various classification methods that can handle the complex structure of metabolomic data, including regularized classifiers, partial least squares discriminant analysis, and nonlinear classification models. We also examine the effectiveness of a physiological normalization technique widely used in the clinical and biochemical literature but not extensively analyzed and compared in urine metabolomic studies. While the main objective of this work is to interrogate metabolomic data of pediatric kidney transplant recipients to improve the diagnosis of T cell-mediated rejection (TCMR), we also analyze three independent datasets from other disease conditions to investigate the generalizability of our findings.


Assuntos
Transplante de Rim , Biomarcadores/urina , Criança , Análise Discriminante , Humanos , Análise dos Mínimos Quadrados , Metabolômica/métodos
3.
EBioMedicine ; 75: 103776, 2022 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-35027333

RESUMO

BACKGROUND: Inter-individual variability during sepsis limits appropriate triage of patients. Identifying, at first clinical presentation, gene expression signatures that predict subsequent severity will allow clinicians to identify the most at-risk groups of patients and enable appropriate antibiotic use. METHODS: Blood RNA-Seq and clinical data were collected from 348 patients in four emergency rooms (ER) and one intensive-care-unit (ICU), and 44 healthy controls. Gene expression profiles were analyzed using machine learning and data mining to identify clinically relevant gene signatures reflecting disease severity, organ dysfunction, mortality, and specific endotypes/mechanisms. FINDINGS: Gene expression signatures were obtained that predicted severity/organ dysfunction and mortality in both ER and ICU patients with accuracy/AUC of 77-80%. Network analysis revealed these signatures formed a coherent biological program, with specific but overlapping mechanisms/pathways. Given the heterogeneity of sepsis, we asked if patients could be assorted into discrete groups with distinct mechanisms (endotypes) and varying severity. Patients with early sepsis could be stratified into five distinct and novel mechanistic endotypes, named Neutrophilic-Suppressive/NPS, Inflammatory/INF, Innate-Host-Defense/IHD, Interferon/IFN, and Adaptive/ADA, each based on ∼200 unique gene expression differences, and distinct pathways/mechanisms (e.g., IL6/STAT3 in NPS). Endotypes had varying overall severity with two severe (NPS/INF) and one relatively benign (ADA) groupings, consistent with reanalysis of previous endotype studies. A 40 gene-classification tool (accuracy=96%) and several gene-pairs (accuracy=89-97%) accurately predicted endotype status in both ER and ICU validation cohorts. INTERPRETATION: The severity and endotype signatures indicate that distinct immune signatures precede the onset of severe sepsis and lethality, providing a method to triage early sepsis patients.


Assuntos
Sepse , Cuidados Críticos , Humanos , Unidades de Terapia Intensiva , Sepse/diagnóstico , Sepse/genética , Índice de Gravidade de Doença , Transcriptoma
4.
Nat Med ; 26(4): 577-588, 2020 04.
Artigo em Inglês | MEDLINE | ID: mdl-32094924

RESUMO

Transmembrane protein 30A (TMEM30A) maintains the asymmetric distribution of phosphatidylserine, an integral component of the cell membrane and 'eat-me' signal recognized by macrophages. Integrative genomic and transcriptomic analysis of diffuse large B-cell lymphoma (DLBCL) from the British Columbia population-based registry uncovered recurrent biallelic TMEM30A loss-of-function mutations, which were associated with a favorable outcome and uniquely observed in DLBCL. Using TMEM30A-knockout systems, increased accumulation of chemotherapy drugs was observed in TMEM30A-knockout cell lines and TMEM30A-mutated primary cells, explaining the improved treatment outcome. Furthermore, we found increased tumor-associated macrophages and an enhanced effect of anti-CD47 blockade limiting tumor growth in TMEM30A-knockout models. By contrast, we show that TMEM30A loss-of-function increases B-cell signaling following antigen stimulation-a mechanism conferring selective advantage during B-cell lymphoma development. Our data highlight a multifaceted role for TMEM30A in B-cell lymphomagenesis, and characterize intrinsic and extrinsic vulnerabilities of cancer cells that can be therapeutically exploited.


Assuntos
Transformação Celular Neoplásica/genética , Mutação com Perda de Função , Linfoma Difuso de Grandes Células B/genética , Linfoma Difuso de Grandes Células B/terapia , Proteínas de Membrana/genética , Terapia de Alvo Molecular , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Animais , Colúmbia Britânica/epidemiologia , Células Cultivadas , Estudos de Coortes , Feminino , Predisposição Genética para Doença , Células HEK293 , Humanos , Células Jurkat , Mutação com Perda de Função/genética , Linfoma Difuso de Grandes Células B/epidemiologia , Linfoma Difuso de Grandes Células B/patologia , Masculino , Camundongos , Camundongos Endogâmicos BALB C , Camundongos Endogâmicos NOD , Camundongos SCID , Camundongos Transgênicos , Pessoa de Meia-Idade , Terapia de Alvo Molecular/métodos , Terapia de Alvo Molecular/tendências , Adulto Jovem
5.
Proteomics Clin Appl ; 13(4): e1700111, 2019 07.
Artigo em Inglês | MEDLINE | ID: mdl-30632678

RESUMO

PURPOSE: A highly-multiplexed LC-ESI-multiple reaction monitoring-MS-based assay is developed for the identification of coronary artery disease (CAD) biomarkers in human plasma. EXPERIMENTAL DESIGN: The assay is used to measure 107 stable isotope labeled peptide standards and native peptides from 64 putative biomarkers of cardiovascular diseases in tryptic digests of plasma from subjects with (n = 70) and without (n = 45) angiographic evidence of CAD and no subsequent cardiovascular mortality during follow-up. RESULTS: Extensive computational and statistical analysis reveals six plasma proteins associated with CAD, namely apolipoprotein CII, C reactive protein, CD5 antigen-like, fibronectin, inter alpha trypsin inhibitor heavy chain H1, and protein S. The identified proteins are combined into a LASSO-logistic score with high classification performance (cross-validated area under the curve = 0.74). When combined with a separate score computed from markers currently used in the clinic with similar performance, the area under the receiver operating curve increases to 0.84. Similar results are observed in an independent set of subjects (n = 87). CONCLUSIONS AND CLINICAL RELEVANCE: If externally validated, the assay and identified biomarkers can improve CAD risk stratification.


Assuntos
Proteínas Sanguíneas/metabolismo , Doença da Artéria Coronariana/sangue , Peptídeos/sangue , Proteômica , Cromatografia Líquida , Feminino , Seguimentos , Humanos , Masculino , Espectrometria de Massas , Pessoa de Meia-Idade
6.
Am J Respir Crit Care Med ; 197(4): 450-462, 2018 02 15.
Artigo em Inglês | MEDLINE | ID: mdl-29087730

RESUMO

RATIONALE: The allergen inhalation challenge is used in clinical trials to test the efficacy of new treatments in attenuating the late-phase asthmatic response (LAR) and associated airway inflammation in subjects with allergic asthma. However, not all subjects with allergic asthma develop the LAR after allergen inhalation. Blood-based transcriptional biomarkers that can identify such individuals may help in subject recruitment for clinical trials as well as provide novel molecular insights. OBJECTIVES: To identify blood-based transcriptional biomarker panels that can predict an individual's response to allergen inhalation challenge. METHODS: We applied RNA sequencing to total RNA from whole blood (n = 36) collected before and after allergen challenge and generated both genome-guided and de novo datasets: genes, gene-isoforms (University of California, Santa Cruz, UCSC Genome Browser), Ensembl, and Trinity. Candidate biomarker panels were validated using the NanoString platform in an independent cohort of 33 subjects. MEASUREMENTS AND MAIN RESULTS: The Trinity biomarker panel consisting of known and novel biomarker transcripts had an area under the receiver operating characteristic curve of greater than 0.70 in both the discovery and validation cohorts. The Trinity biomarker panel was useful in predicting the response of subjects that elicited different responses (accuracy between 0.65 and 0.71) and subjects that elicit a dual response (accuracy between 0.70 and 0.75) upon repeated allergen inhalation challenges. CONCLUSIONS: Interestingly, the biomarker panel containing novel transcripts successfully validated compared with panels with known, well-characterized genes. These biomarker-blood tests may be used to identify subjects with asthma who develop the LAR, and may also represent members of novel molecular mechanisms that can be targeted for therapy.


Assuntos
Asma/sangue , Asma/diagnóstico , Testes de Provocação Brônquica/métodos , Perfilação da Expressão Gênica/métodos , Adulto , Asma/genética , Biomarcadores/sangue , Feminino , Humanos , Masculino , Valor Preditivo dos Testes , Adulto Jovem
7.
PLoS One ; 12(5): e0177569, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28562641

RESUMO

The quantitation of proteins using shotgun proteomics has gained popularity in the last decades, simplifying sample handling procedures, removing extensive protein separation steps and achieving a relatively high throughput readout. The process starts with the digestion of the protein mixture into peptides, which are then separated by liquid chromatography and sequenced by tandem mass spectrometry (MS/MS). At the end of the workflow, recovering the identity of the proteins originally present in the sample is often a difficult and ambiguous process, because more than one protein identifier may match a set of peptides identified from the MS/MS spectra. To address this identification problem, many MS/MS data processing software tools combine all plausible protein identifiers matching a common set of peptides into a protein group. However, this solution introduces new challenges in studies with multiple experimental runs, which can be characterized by three main factors: i) protein groups' identifiers are local, i.e., they vary run to run, ii) the composition of each group may change across runs, and iii) the supporting evidence of proteins within each group may also change across runs. Since in general there is no conclusive evidence about the absence of proteins in the groups, protein groups need to be linked across different runs in subsequent statistical analyses. We propose an algorithm, called Protein Group Code Algorithm (PGCA), to link groups from multiple experimental runs by forming global protein groups from connected local groups. The algorithm is computationally inexpensive and enables the connection and analysis of lists of protein groups across runs needed in biomarkers studies. We illustrate the identification problem and the stability of the PGCA mapping using 65 iTRAQ experimental runs. Further, we use two biomarker studies to show how PGCA enables the discovery of relevant candidate protein group markers with similar but non-identical compositions in different runs.


Assuntos
Algoritmos , Proteínas/química , Espectrometria de Massas em Tandem/métodos , Sequência de Aminoácidos , Biomarcadores , Transplante de Coração , Humanos , Distrofias Musculares/metabolismo , Proteômica , Homologia de Sequência de Aminoácidos
8.
J Proteomics ; 118: 2-11, 2015 Apr 06.
Artigo em Inglês | MEDLINE | ID: mdl-25753122

RESUMO

Multiple sclerosis (MS) is associated with chronic degeneration of the central nervous system and may cause permanent neurological problems and considerable disability. While its causes remain unclear, its extensive phenotypic variability makes its prognosis and treatment difficult. The identification of serum proteomic biomarkers of MS progression could further our understanding of the molecular mechanisms related to MS disease processes. In the current study, we used isobaric tagging for relative and absolute protein quantification (iTRAQ) methodology and advanced multivariate statistical analysis to quantify and identify potential serum biomarker proteins of MS progression. We identified a panel of 11 proteins and combined them into a classifier that best classified samples into the two disease groups. The estimated area under the receiver operating curve of this classifier was 0.88 (p-value=0.017), with 86% sensitivity and specificity. The identified proteins encompassed processes related to inflammation, opsonization, and complement activation. Results from this study are in particular valuable to design a targeted Multiple Reaction Monitoring mass spectrometry based (MRM-MS) assay to conduct an external validation in an independent and larger cohort of patients. Validated biomarkers may result in the development of a minimally-invasive tool to monitor MS progression and complement current clinical practices. BIOLOGICAL SIGNIFICANCE: A hallmark of multiple sclerosis is the unpredictable disease course (progression). There are currently no clinically useful biomarkers of MS disease progression; most work has focused on the analysis of CSF, which requires an invasive procedure. Here, we explore the potential of proteomics to identify panels of serum biomarkers of disease progression in MS. By comparing the protein signatures of two challenging to obtain, but well-defined, MS phenotypic groups at the extremes of progression (benign and aggressive cases of MS), we identified proteins that encompass processes related to inflammation, opsonization, and complement activation. Findings require validation, but are an important step on the pathway to clinically useful biomarker discovery. This article is part of a Special Issue entitled: Protein dynamics in health and disease. Guest Editors: Pierre Thibault and Anne-Claude Gingras.


Assuntos
Proteínas Sanguíneas/metabolismo , Progressão da Doença , Esclerose Múltipla/sangue , Proteoma/metabolismo , Proteômica , Adulto , Biomarcadores/sangue , Feminino , Humanos , Masculino
9.
Eur J Heart Fail ; 16(5): 551-9, 2014 May.
Artigo em Inglês | MEDLINE | ID: mdl-24574204

RESUMO

AIMS: Chronic heart failure is a costly epidemic that affects up to 2% of people in developed countries. The purpose of this study was to discover novel blood proteomic biomarker signatures of recovered heart function that could lead to more effective heart failure patient management by both primary care and specialty physicians. METHODS AND RESULTS: The discovery cohort included 41 heart transplant patients and 20 healthy individuals. Plasma levels of 138 proteins were detected in at least 75% of these subjects by iTRAQ mass spectrometry. Eighteen proteins were identified that had (i) differential levels between pre-transplant patients with end-stage heart failure and healthy individuals; and (ii) levels that returned to normal by 1 month post-transplant in patients with stable heart function after transplantation. Seventeen of the 18 markers were validated by multiple reaction monitoring mass spectrometry in a cohort of 39 heart failure patients treated with drug therapy, of which 30 had recovered heart function and 9 had not. This 17-protein biomarker panel had 93% sensitivity and 89% specificity, while the RAMP® NT-proBNP assay had the same specificity but 80% sensitivity. Performance further improved when the panel was combined with NT-proBNP, yielding a net reclassification index relative to NT-proBNP of 0.28. CONCLUSIONS: We have identified potential blood biomarkers of recovered heart function by harnessing data from transplant patients. These biomarkers can lead to the development of an inexpensive protein-based blood test that could be used by physicians to monitor response to therapy in heart failure, resulting in more personalized, front-line heart failure patient management.


Assuntos
Proteínas Sanguíneas , Fármacos Cardiovasculares/uso terapêutico , Insuficiência Cardíaca , Transplante de Coração/métodos , Adulto , Idoso , Biomarcadores/análise , Biomarcadores/sangue , Proteínas Sanguíneas/análise , Proteínas Sanguíneas/classificação , Interpretação Estatística de Dados , Monitoramento de Medicamentos/métodos , Feminino , Insuficiência Cardíaca/sangue , Insuficiência Cardíaca/diagnóstico , Insuficiência Cardíaca/tratamento farmacológico , Insuficiência Cardíaca/cirurgia , Humanos , Masculino , Pessoa de Meia-Idade , Peptídeo Natriurético Encefálico/sangue , Avaliação de Resultados em Cuidados de Saúde , Fragmentos de Peptídeos/sangue , Assistência Perioperatória/métodos , Recuperação de Função Fisiológica/fisiologia , Projetos de Pesquisa , Sensibilidade e Especificidade
10.
PLoS Comput Biol ; 9(4): e1002963, 2013 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-23592955

RESUMO

Recent technical advances in the field of quantitative proteomics have stimulated a large number of biomarker discovery studies of various diseases, providing avenues for new treatments and diagnostics. However, inherent challenges have limited the successful translation of candidate biomarkers into clinical use, thus highlighting the need for a robust analytical methodology to transition from biomarker discovery to clinical implementation. We have developed an end-to-end computational proteomic pipeline for biomarkers studies. At the discovery stage, the pipeline emphasizes different aspects of experimental design, appropriate statistical methodologies, and quality assessment of results. At the validation stage, the pipeline focuses on the migration of the results to a platform appropriate for external validation, and the development of a classifier score based on corroborated protein biomarkers. At the last stage towards clinical implementation, the main aims are to develop and validate an assay suitable for clinical deployment, and to calibrate the biomarker classifier using the developed assay. The proposed pipeline was applied to a biomarker study in cardiac transplantation aimed at developing a minimally invasive clinical test to monitor acute rejection. Starting with an untargeted screening of the human plasma proteome, five candidate biomarker proteins were identified. Rejection-regulated proteins reflect cellular and humoral immune responses, acute phase inflammatory pathways, and lipid metabolism biological processes. A multiplex multiple reaction monitoring mass-spectrometry (MRM-MS) assay was developed for the five candidate biomarkers and validated by enzyme-linked immune-sorbent (ELISA) and immunonephelometric assays (INA). A classifier score based on corroborated proteins demonstrated that the developed MRM-MS assay provides an appropriate methodology for an external validation, which is still in progress. Plasma proteomic biomarkers of acute cardiac rejection may offer a relevant post-transplant monitoring tool to effectively guide clinical care. The proposed computational pipeline is highly applicable to a wide range of biomarker proteomic studies.


Assuntos
Biomarcadores/análise , Proteínas Sanguíneas/análise , Biologia Computacional/métodos , Transplante de Coração , Proteômica/métodos , Calibragem , Estudos de Coortes , Ensaio de Imunoadsorção Enzimática , Rejeição de Enxerto , Insuficiência Cardíaca/terapia , Humanos , Inflamação , Espectrometria de Massas , Proteoma/análise
11.
J Heart Lung Transplant ; 32(2): 259-65, 2013 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-23265908

RESUMO

BACKGROUND: Acute rejection in cardiac transplant patients remains a contributory factor to limited survival of implanted hearts. Currently, there are no biomarkers in clinical use that can predict, at the time of transplantation, the likelihood of post-transplant acute cellular rejection. Such a development would be of great value in personalizing immunosuppressive treatment. METHODS: Recipient age, donor age, cold ischemic time, warm ischemic time, panel-reactive antibody, gender mismatch, blood type mismatch and human leukocyte antigens (HLA-A, -B and -DR) mismatch between recipients and donors were tested in 53 heart transplant patients for their power to predict post-transplant acute cellular rejection. Donor transplant biopsy and recipient pre-transplant blood were also examined for the presence of genomic biomarkers in 7 rejection and 11 non-rejection patients, using non-targeted data mining techniques. RESULTS: The biomarker based on the 8 clinical variables had an area under the receiver operating characteristic curve (AUC) of 0.53. The pre-transplant recipient blood gene-based panel did not yield better performance, but the donor heart tissue gene-based panel had an AUC = 0.78. A combination of 25 probe sets from the transplant donor biopsy and 18 probe sets from the pre-transplant recipient whole blood had an AUC = 0.90. Biologic pathways implicated include VEGF- and EGFR-signaling, and MAPK. CONCLUSIONS: Based on this study, the best predictive biomarker panel contains genes from recipient whole blood and donor myocardial tissue. This panel provides clinically relevant prediction power and, if validated, may personalize immunosuppressive treatment and rejection monitoring.


Assuntos
Expressão Gênica , Rejeição de Enxerto/epidemiologia , Transplante de Coração/imunologia , Adulto , Biomarcadores/análise , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Valor Preditivo dos Testes , Curva ROC , Medição de Risco , Sensibilidade e Especificidade
12.
Proteomics Clin Appl ; 6(9-10): 476-85, 2012 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-22930592

RESUMO

PURPOSE: This proteomics study was designed to determine the utility of iTRAQ MALDI-TOF/TOF technology to compare plasma samples from carefully phenotyped mild, atopic asthma subjects undergoing allergen inhalation challenge. EXPERIMENTAL DESIGN: Eight adult subjects with mild, allergic asthma (four early responders (ERs) and four dual responders (DRs)) participated in the allergen inhalation challenge. Blood samples were collected prior to and 2 h after the inhalation challenge. Sixteen plasma samples (two per subject), technical replicates, and pooled controls were analyzed using iTRAQ. Technical validation was performed using LC-MRM/MS. Moderated robust regression was used to determine differentially expressed proteins. RESULTS: Although this study did not show significant differences between pre- and post-challenge samples, discriminant analysis indicated that certain proteins responded differentially to allergen challenge with respect to responder type. At pre-challenge, fibronectin was significantly elevated in DRs compared to ERs and remained significant in the multiple reaction monitoring validation. CONCLUSIONS AND CLINICAL RELEVANCE: This proof of principle demonstration has shown that iTRAQ can uncover differences in the human plasma proteome between two endotypes of asthma and merits further application of iTRAQ to larger cohorts of asthma and other respiratory diseases.


Assuntos
Asma/sangue , Proteômica , Administração por Inalação , Adulto , Alérgenos/imunologia , Asma/imunologia , Asma/patologia , Cromatografia Líquida de Alta Pressão , Análise Discriminante , Feminino , Fibronectinas/sangue , Humanos , Masculino , Pessoa de Meia-Idade , Proteoma/análise , Espectrometria de Massas por Ionização e Dessorção a Laser Assistida por Matriz , Adulto Jovem
14.
J Proteomics ; 75(12): 3514-28, 2012 Jun 27.
Artigo em Inglês | MEDLINE | ID: mdl-22146476

RESUMO

In this study we demonstrate the use of a multiplexed MRM-based assay to distinguish among normal (NL) and iron-metabolism disorder mouse models, particularly, iron-deficiency anemia (IDA), inflammation (INFL), and inflammation and anemia (INFL+IDA). Our initial panel of potential biomarkers was based on the analysis of 14 proteins expressed by candidate genes involved in iron transport and metabolism. Based on this study, we were able to identify a panel of 8 biomarker proteins: apolipoprotein A4 (APO4), transferrin, transferrin receptor 1, ceruloplasmin, haptoglobin, lactoferrin, hemopexin, and matrix metalloproteinase-8 (MMP8) that clearly distinguish among the normal and disease models. Within this set of proteins, transferrin showed the best individual classification accuracy over all samples (72%) and within the NL group (94%). Compared to the best single-protein biomarker, transferrin, the use of the composite 8-protein biomarker panel improved the classification accuracy from 94% to 100% in the NL group, from 50% to 72% in the INFL group, from 66% to 96% in the IDA group, and from 79% to 83% in the INFL+IDA group. Based on these findings, validation of the utility of this potentially important biomarker panel in human samples in an effort to differentiate IDA, inflammation, and combinations thereof, is now warranted. This article is part of a Special Section entitled: Understanding genome regulation and genetic diversity by mass spectrometry.


Assuntos
Anemia Ferropriva/sangue , Anemia Ferropriva/diagnóstico , Anemia/sangue , Anemia/diagnóstico , Proteínas Sanguíneas/análise , Inflamação/complicações , Espectrometria de Massas/métodos , Anemia/etiologia , Animais , Biomarcadores/sangue , Proteínas Sanguíneas/química , Diagnóstico Diferencial , Feminino , Inflamação/sangue , Inflamação/diagnóstico , Camundongos , Camundongos Endogâmicos C57BL , Mapeamento de Peptídeos/métodos , Reprodutibilidade dos Testes , Sensibilidade e Especificidade
15.
Bioinformatics ; 23(23): 3162-9, 2007 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-17933854

RESUMO

MOTIVATION: The process of producing microarray data involves multiple steps, some of which may suffer from technical problems and seriously damage the quality of the data. Thus, it is essential to identify those arrays with low quality. This article addresses two questions: (1) how to assess the quality of a microarray dataset using the measures provided in quality control (QC) reports; (2) how to identify possible sources of the quality problems. RESULTS: We propose a novel multivariate approach to evaluate the quality of an array that examines the 'Mahalanobis distance' of its quality attributes from those of other arrays. Thus, we call it Mahalanobis Distance Quality Control (MDQC) and examine different approaches of this method. MDQC flags problematic arrays based on the idea of outlier detection, i.e. it flags those arrays whose quality attributes jointly depart from those of the bulk of the data. Using two case studies, we show that a multivariate analysis gives substantially richer information than analyzing each parameter of the QC report in isolation. Moreover, once the QC report is produced, our quality assessment method is computationally inexpensive and the results can be easily visualized and interpreted. Finally, we show that computing these distances on subsets of the quality measures in the report may increase the method's ability to detect unusual arrays and helps to identify possible reasons of the quality problems. AVAILABILITY: The library to implement MDQC will soon be available from Bioconductor.


Assuntos
Algoritmos , Interpretação Estatística de Dados , Bases de Dados Genéticas , Perfilação da Expressão Gênica/métodos , Armazenamento e Recuperação da Informação/métodos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Análise Multivariada , Controle de Qualidade , Reprodutibilidade dos Testes , Sensibilidade e Especificidade
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...